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<210> 1 

<211> 3252 

<212> DNA 

<213> Homo sapiens 
<220> 

<22 3> memapsin 2 



<400> 1 

gcgggagtgc tgcctgccca cggcacccag cacggcatcc ggctgcccct gcgcagcggc 60 
ctggggggcg cccccctggg gctgcggctg ccccgggaga ccgacgaaga gcccgaggag 12 0 
cccggccgga ggggcagctt tgtggagatg gtggacaacc tgaggggcaa gtcggggcag 180 
ggctactacg tggagatgac cgtgggcagc cccccgcaga cgctcaacat cctggtggat 24 0 
acaggcagca gtaactttgc agtgggtgct gccccccacc ccttcctgca tcgctactac 300 
cagaggcagc tgtccagcac ataccgggac ctccggaagg gtgtgtatgt gccctacacc 360 
cagggcaagt gggaagggga gctgggcacc gacctggtaa gcatccccca tggccccaac 42 0 
gtcactgtgc gtgccaacat tgctgccatc actgaatcag acaagttctt catcaacggc 4 80 
tccaactggg aaggcatcct ggggctggcc tatgctgaga ttgccaggcc tgacgactcc 54 0 
ctggagcctt tctttgactc tctggtaaag cagacccacg ttcccaacct cttctccctg 600 
cagctttgtg gtgctggctt ccccctcaac cagtctgaag tgctggcctc tgtcggaggg 660 
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agcatgatca 
atccggcggg 
ctgaaaatgg 
aaccttcgtt 
tccacggaga 
ggcaccaccc 
aaccagtcct 
gccacgtccc 
atgggagctg 
ggctttgctg 
ccttttgtca 
accctcatga 
tgcctcatgg 
gctgatgaca 
ggaccacacc 
cagagcacct 
ggctggcaag 
cactctgctg 
tgaaacttca 
cttcttttct 
tgtggtaccc 
gaggatgcac 
ggtgcaaaga 
gcggctggaa 
aggaaaggca 
catcccatct 
accaaaagtg 
tgaagtgccc 
tctctatctg 
cctaagctcc 
tcctggtcat 
ctaaaaggaa 
tcattcaaca 
agtagcaaga 
cccctccatt 
agaaatacag 
aaggatggca 
catctttttc 
gctggttctt 
agtaaactac 
tcctactcct 
acctgataag 
gcccmcaagc 
aaaaaaaaaa 



ttggaggtat 
agtggtatta 
actgcaagga 
tgcccaagaa 
agttccctga 
cttggaacat 
tccgcatcac 
aagacgactg 
ttatcatgga 
tcagcgcttg 
ccttggacat 
ccatagccta 
tgtgtcagtg 
tctccctgct 
tccgtggttc 
caggaccctc 
gtgggttcca 
gcgggaatac 
gccctgaacc 
tagtttcaga 
tggcagagaa 
agtttgctat 
ttgcctcttg 
agaggagaag 
gaaacacaac 
cagaagatgg 
agatgggaag 
actaagaagt 
aaccaccctt 
aggtgccctg 
aggctcactc 
taggtaggag 
gctgatgccc 
tctttacata 
tatttgacta 
tgctttatgg 
gcctcagggc 
ccctatcctg 
gggctaggta 
ggtaccagtg 
acctggtcaa 
ggagagggaa 
cataaaccaa 
aa 



cgaccactcg 
tgaggtgatc 
gtacaactat 
agtgtttgaa 
tggtttctgg 
tttcccagtc 
catccttccg 
ttacaagttt 
gggcttctac 
ccatgtgcac 
ggaagactgt 
tgtcatggct 
gcgctgcctc 
gaagtgagga 
actttggtca 
cccacccacc 
gggactgtac 
tcttggtcac 
tttgtccacc 
agtactggca 
gagaccaagc 
ttgctttaga 
aattaaaaaa 
gagagggagt 
cactcaccag 
gtgttgtttt 
ggcttatcta 
tccacttaac 
tattctacat 
tgggagagca 
tttcccccaa 
acctcttcta 
tataacccct 
attcagagtg 
aagcatcrca 
ctctaacatt 
ttccttatgt 
ttcttcccct 
gtggggacca 
ttagtgggaa 
cccgctgctt 
atacaaggag 
taaamcaaga 



ctgtacacag 
attgtgcggg 
gacaagagca 
gctgcagtca 
ctaggagagc 
atctcactct 
cagcaatacc 
gccatctcac 
gttgtctttg 
gatgagttca 
ggctacaaca 
gccatctgcg 
cgctgcctgc 
ggcccatggg 
caagtaggag 
aaatgcctct 
ctgtaggaaa 
ctcaaattta 
attcctttaa 
tcacacgcag 
ttgtttccct 
gacagggact 
aaactagatt 
acaaagacag 
tcctagtttt 
caatgttttc 
gccaaagagc 
acatgaattt 
atgataggca 
actggactat 
atcttcctct 
tctaatcctt 
gcctggattt 
gtttcattgc 
cagtggcact 
actgccttca 
cctccaccac 
ccccgctcct 
agttcattac 
gagctgggtt 
ccaggtatgg 
ggcctctggt 
atactgagtc 



gcagtctctg 
tggagatcaa 
ttgtggacag 
aatccatcaa 
agctggtgtg 
acctaatggg 
tgcggccagt 
agtcatccac 
atcgggcccg 
ggacggcagc 
ttccacagac 
ccctcttcat 
gccagcagca 
cagaagatag 
acacagatgg 
gccttgatgg 
cagaaaagag 
agtcgggaaa 
attctccaac 
gttaccttgg 
gctggccaaa 
gtataaacaa 
gactatttat 
ggaatagtgg 
agacctcatc 
ttttctgtgg 
tcttttttag 
ctgccatatt 
gcactgaaat 
agcagggctg 
ggagctttgc 
aaaagcataa 
cttcctatta 
cttcctaccc 
agcattatac 
gtatcaaggc 
aagagctcct 
aatggtacgt 
ctccctatca 
ttcctagtat 
gacctgctaa 
gttcctggcc 
taaaaaaaaa 



gtatacaccc 
tggacaggat 
tggcaccacc 
ggcagcctcc 
ctggcaagca 
tgaggttacc 
ggaagatgtg 
gggcactgtt 
aaaacgaatt 
ggtggaaggc 
agatgagtca 
gctgccactc 
tgatgacttt 
agattcccct 
cacctgtggc 
agaaggaaaa 
aagaaagaag 
ttctgctgct 
ccaaagtatt 
cgtgtgtccc 
gtcagtagga 
gcctaacatt 
acaaatgggg 
gatcaaagct 
tccaagatag 
ttgcagcctg 
ctctcttaaa 
aatttcattg 
atcctaaccc 
ggctctgtct 
agccaaggtg 
tgttgaacat 
ggctataaga 
tctctaatgg 
caagagtatg 
tgcctggaga 
tgatgaaggt 
gggtacccag 
gttctagcat 
acccactgca 
gtgtggaatt 
tcagccagct 
aaaaaaaaaa 
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<210> 2 

<211> 488 

<212> PRT 

<213> Homo sapiens 
<220> 

<223> purified memapsin 2, aspartic proteinase 2 (ASP2) 
<220> 

<223> amino acids 28-48 are remnant putative propeptide 
residues 
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<220> 

<223> amino acids 54-57, 61-68, 73-80, 86-89, 109-111, 
113-118, 123-134, 143-154, 165-168, 198-202, and 
220-224 are N-lobe beta strands 

<220> 

<223> amino acids 58-61, 78, 80, 82-83, 116, 118-121, 
156, 166, 174, 246, 274, 276, 278-281, 283, and 
376-377 are residues in contact with the OM99-2 
inhibitor 

<220> 

<223> amino acids 184-191 and 210-217 are N-lobe helices 
<220> 

<223> amino acids 237-240, 247-249, 251-256, 259-260, 
273-275, 282-285, 316-318, 331-336, 342-348, 
354-357, 366-370, 372-375, 380-383, 390-395, 
400-405, and 418-420 are C-lobe beta strands 

<220> 

<223> amino acids 286-299, 307-310, 350-353, 384-387, 
and 427-431 are C-lobe helices 

<400> 2 

Ala Gly Val Leu Pro Ala His Gly Thr Gin His Gly lie Arg Leu Pro 
1 5 10 15 

Leu Arg Ser Gly Leu Gly Gly Ala Pro Leu Gly Leu Arg Leu Pro Arg 
20 25 30 

Glu Thr Asp Glu Glu Pro Glu Glu Pro Gly Arg Arg Gly Ser Phe Val 
35 40 45 

Glu Met Val Asp Asn Leu Arg Gly Lys Ser Gly Gin Gly Tyr Tyr Val 
50 55 60 

Glu Met Thr Val Gly Ser Pro Pro Gin Thr Leu Asn lie Leu Val Asp 
65 70 75 80 

Thr Gly Ser Ser Asn Phe Ala Val Gly Ala Ala Pro His Pro Phe Leu 
85 90 95 

His Arg Tyr Tyr Gin Arg Gin Leu Ser Ser Thr Tyr Arg Asp Leu Arg 
100 105 110 

Lys Gly Val Tyr Val Pro Tyr Thr Gin Gly Lys Trp Glu Gly Glu Leu 
115 120 125 

Gly Thr Asp Leu Val Ser lie Pro His Gly Pro Asn Val Thr Val Arg 
130 135 140 

Ala Asn lie Ala Ala lie Thr Glu Ser Asp Lys Phe Phe lie Asn Gly 
145 150 155 160 

Ser Asn Trp Glu Gly lie Leu Gly Leu Ala Tyr Ala Glu lie Ala Arg 
165 170 175 

Pro Asp Asp Ser Leu Glu Pro Phe Phe Asp Ser Leu Val Lys Gin Thr 
180 185 190 
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His Val Pro Asn Leu Phe Ser Leu Gin Leu Cys Gly Ala Gly Phe Pro 
195 200 205 

Leu Asn Gin Ser Glu Val Leu Ala Ser Val Gly Gly Ser Met lie He 
210 215 220 

Gly Gly He Asp His Ser Leu Tyr Thr Gly Ser Leu Trp Tyr Thr Pro 
225 230 235 240 

He Arg Arg Glu Trp Tyr Tyr Glu Val He He Val Arg Val Glu He 
245 250 255 

Asn Gly Gin Asp Leu Lys Met Asp Cys Lys Glu Tyr Asn Tyr Asp Lys 
260 265 270 

Ser He Val Asp Ser Gly Thr Thr Asn Leu Arg Leu Pro Lys Lys Val 
275 280 285 

Phe Glu Ala Ala Val Lys Ser He Lys Ala Ala Ser Ser Thr Glu Lys 
290 295 300 

Phe Pro Asp Gly Phe Trp Leu Gly Glu Gin Leu Val Cys Trp Gin Ala 
305 310 315 320 

Gly Thr Thr Pro Trp Asn He Phe Pro Val He Ser Leu Tyr Leu Met 
325 330 335 

Gly Glu Val Thr Asn Gin Ser Phe Arg He Thr He Leu Pro Gin Gin 
340 345 350 

Tyr Leu Arg Pro Val Glu Asp Val Ala Thr Ser Gin Asp Asp Cys Tyr 
355 360 365 

Lys Phe Ala He Ser Gin Ser Ser Thr Gly Thr Val Met Gly Ala Val 
370 375 380 

He Met Glu Gly Phe Tyr Val Val Phe Asp Arg Ala Arg Lys Arg He 
385 390 395 400 

Gly Phe Ala Val Ser Ala Cys His Val His Asp Glu Phe Arg Thr Ala 
405 410 415 

Ala Val Glu Gly Pro Phe Val Thr Leu Asp Met Glu Asp Cys Gly Tyr 
420 425 430 

Asn He Pro Gin Thr Asp Glu Ser Thr Leu Met Thr He Ala Tyr Val 
435 440 445 

Met Ala Ala He Cys Ala Leu Phe Met Leu Pro Leu Cys Leu Met Val 
450 455 460 

Cys Gin Trp Arg Cys Leu Arg Cys Leu Arg Gin Gin His Asp Asp Phe 
465 470 475 480 

Ala Asp Asp He Ser Leu Leu Lys 
485 
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<210> 3 

<211> 503 

<212> PRT 

<213> Homo sapiens 
<220> 

<223> pro-memapsin 2 



<220> 

<223> amino acids 1-13 are the T7 promoter 
<220> 

<223> amino acids 1-15 are vector-derived residues 



<220> 

<223> amino acids 16-64 are a putative pro-peptide 
<220> 

<223> amino acids 16-456 are pro-memapsin 2 Tl 
<220> 

<223> amino acids 16-421 are promemapsin 2 T2 



<400> 3 

Met Ala Ser Met Thr Gly Gly Gin Gin Met Gly Arg Gly Ser Met Ala 
15 10 15 

Gly Val Leu Pro Ala His Gly Thr Gin His Gly lie Arg Leu Pro Leu 
20 25 30 

Arg Ser Gly Leu Gly Gly Ala Pro Leu Gly Leu Arg Leu Pro Arg Glu 
35 40 45 

Thr Asp Glu Glu Pro Glu Glu Pro Gly Arg Arg Gly Ser Phe Val Glu 
50 55 60 

Met Val Asp Asn Leu Arg Gly Lys Ser Gly Gin Gly Tyr Tyr Val Glu 
65 70 75 80 

Met Thr Val Gly Ser Pro Pro Gin Thr Leu Asn lie Leu Val Asp Thr 
85 90 95 

Gly Ser Ser Asn Phe Ala Val Gly Ala Ala Pro His Pro Phe Leu His 
100 105 110 

Arg Tyr Tyr Gin Arg Gin Leu Ser Ser Thr Tyr Arg Asp Leu Arg Lys 
115 120 125 

Gly Val Tyr Val Pro Tyr Thr Gin Gly Lys Trp Glu Gly Glu Leu Gly 
130 135 140 

Thr Asp Leu Val Ser lie Pro His Gly Pro Asn Val Thr Val Arg Ala 
145 150 155 160 

Asn lie Ala Ala lie Thr Glu Ser Asp Lys Phe Phe lie Asn Gly Ser 
165 170 175 

Asn Trp Glu Gly lie Leu Gly Leu Ala Tyr Ala Glu lie Ala Arg Pro 
180 185 190 
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Asp Asp Ser Leu Glu Pro Phe Phe Asp Ser Leu Val Lys Gin Thr His 
195 200 205 



Val Pro Asn Leu Phe Ser Leu Gin Leu Cys Gly Ala Gly Phe Pro Leu 
210 215 220 

Asn Gin Ser Glu Val Leu Ala Ser Val Gly Gly Ser Met lie lie Gly 
225 230 235 240 

Gly lie Asp His Ser Leu Tyr Thr Gly Ser Leu Trp Tyr Thr Pro lie 
245 250 255 

Arg Arg Glu Trp Tyr Tyr Glu Val lie lie Val Arg Val Glu lie Asn 
260 265 270 

Gly Gin Asp Leu Lys Met Asp Cys Lys Glu Tyr Asn Tyr Asp Lys Ser 
275 280 285 

lie Val Asp Ser Gly Thr Thr Asn Leu Arg Leu Pro Lys Lys Val Phe 
290 295 300 

Glu Ala Ala Val Lys Ser lie Lys Ala Ala Ser Ser Thr Glu Lys Phe 
305 310 315 320 

Pro Asp Gly Phe Trp Leu Gly Glu Gin Leu Val Cys Trp Gin Ala Gly 
325 330 335 

Thr Thr Pro Trp Asn lie Phe Pro Val lie Ser Leu Tyr Leu Met Gly 
340 345 350 

Glu Val Thr Asn Gin Ser Phe Arg lie Thr lie Leu Pro Gin Gin Tyr 
355 360 365 

Leu Arg Pro Val Glu Asp Val Ala Thr Ser Gin Asp Asp Cys Tyr Lys 
370 375 380 

Phe Ala lie Ser Gin Ser Ser Thr Gly Thr Val Met Gly Ala Val lie 
385 390 395 400 

Met Glu Gly Phe Tyr Val Val Phe Asp Arg Ala Arg Lys Arg lie Gly 
405 410 415 

Phe Ala Val Ser Ala Cys His Val His Asp Glu Phe Arg Thr Ala Ala 
420 425 430 

Val Glu Gly Pro Phe Val Thr Leu Asp Met Glu Asp Cys Gly Tyr Asn 
435 440 445 

lie Pro Gin Thr Asp Glu Ser Thr Leu Met Thr lie Ala Tyr Val Met 
450 455 460 

Ala Ala lie Cys Ala Leu Phe Met Leu Pro Leu Cys Leu Met Val Cys 
465 470 475 480 

Gin Trp Arg Cys Leu Arg Cys Leu Arg Gin Gin His Asp Asp Phe Ala 
485 490 495 

Asp Asp lie Ser Leu Leu Lys 
500 
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<210> 4 
<211> 10 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : synthetic 

peptide derived from amyloid precursor protein 
(APP) beta-secretase site 

<400> 4 

Ser Glu Val Lys Met Asp Ala Glu Phe Arg 
15 10 



<210> 5 
<211> 10 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : synthetic 

peptide derived from amyloid precursor protein 
(APP) beta-secretase site containing the "Swedish 
mutation" 

<400> 5 

Ser Glu Val Asn Leu Asp Ala Glu Phe Arg 
1 5 10 



<210> 6 
<211> 8 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : synthetic 
peptide derived from the processing site of 
presenilin 1 

<400> 6 

Ser Val Asn Met Ala Glu Gly Asp 
1 5 



<210> 7 
<211> 12 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : synthetic 

peptide derived from amyloid precursor protein 
(APP) gamma-secretase site 

<400> 7 

Lys Gly Gly Val Val lie Ala Thr Val lie Val Lys 
15 10 
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<210> 8 
<211> 3 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : signature 
aspartic protease sequence motif 

<220> 

<221> MOD_RES 
<222> (2) 

<223> Xaa = Thr or Ser 

<400> 8 
Asp Xaa Gly 
1 



<210> 9 
<211> 8 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : synthetic 

peptide PSl-gamma in vivo beta-processing site 
human presenilin 1, human memapsin 2 cleavage 
substrate 

<400> 9 

Leu Val Asn Met Ala Glu Gly Asp 
1 5 



<210> 10 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : PCR 
amplification primer NHASPF1 

<400> 10 

ggtaagcatc ccccatggcc ccaacgtc 



<210> 11 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PCR 
amplification primer NHASPR1 

<400> 11 

gacgttgggg ccatggggga tgcttacc 
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<210> 12 

<211> 34 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : PCR 
amplification primer NHASPF2 

<400> 12 

acgttgtctt tgatcgggcc cgaaaacgaa ttgg 34 



<210> 13 
<211> 33 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence :PCR 
amplification primer NHASPR2 

<400> 13 

ccaattcgtt ttcgggcccg atcaaagaca acg 33 



<210> 14 

<211> 27 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : PCR 
amplification adapter primer API 

<400> 14 

ccatcctaat acgactcact atagggc 27 



<210> 15 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : PCR 
amplification nested adapter primer AP2 

<400> 15 

actcactata gggctcgagc ggc 23 



<210> 16 

<211> 26 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : outside primer 
GT10FWD 
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<400> 16 

cttttgagca agttcagcct ggttaa 



26 



<210> 17 
<211> 31 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : outside primer 
GT10REV 

<400> 17 

gaggtggctt atgagtattt cttccagggt a 31 



<210> 18 

<211> 22 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : outside primer 
GT11FWD 

<400> 18 

tggcgacgac tcctggagcc eg 22 



<210> 19 

<211> 24 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : outside primer 
GT11REV 

<400> 19 

tgacaccaga ccaactggta atgg 24 



<210> 20 

<211> 27 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : amplification 
primer PASPN1 

<400> 20 

catatggegg gagtgetgee tgcccac 27 



<210> 21 

<211> 38 

<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Description of Artificial Sequence : amplification 
primer NHASPC1 

<400> 21 

ggatcctcac ttcagcaggg agatgtcatc agcaaagt 



<210> 22 
<211> 8 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : oxidized bovine 
insulin beta chain hydrolytic site #5 

<220> 

<221> MOD_RES 
<222> (3) 

<223> Xaa = cysteic acid 
<400> 22 

His Leu Xaa Gly Ser His Leu Val 
1 5 



<210> 23 
<211> 8 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : oxidized bovine 
insulin beta chain hydrolytic site #6 

<220> 

<221> MOD_RES 
<222> (1) 

<223> Xaa = cysteic acid 
<400> 23 

Xaa Gly Glu Arg Gly Phe Phe Tyr 
1 5 



<210> 24 

<211> 5 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : synthetic 
peptide NCH-gamma hydrolytic site #7 

<400> 24 

Val Gly Ser Gly Val 
1 5 



11 



<210> 25 
<211> 7 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : synthetic 
peptide NCH-gamma hydrolytic site #8 

<400> 25 

Val Gly Ser Gly Val Leu Leu 
1 5 



<210> 26 
<211> 7 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : synthetic 
peptide NCH-gamma hydrolytic site #9 

<400> 26 

Gly Val Leu Leu Ser Arg Lys 
1 5 



<210> 27 
<211> 7 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : synthetic 

memapsin 2 inhibitor substrate analogue OM99-1 

<220> 

<221> MOD_RES 
<222> (3) . . (4) 

<223> Xaa at positions 3 and 4 represent Leu and Ala 
with the peptide bond substituted by a 
transition- state isostere hydroxy ethylene 
(-CH(OH) -CH-2-) group 

<400> 27 

Val Asn Xaa Xaa Ala Glu Phe 
1 5 



<210> 28 
<211> 8 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : octapeptide 
upon which synthetic memapsin 2 inhibitor 
substrate analogue OM99-2 is based 
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<400> 28 

Glu Val Asn Leu Ala Ala Glu Phe 
1 5 



<210> 29 
<211> 4 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : retained 
structure of synthetic memapsin 2 inhibitor 
substrate analogue OM99-1 with outside subsites 
P4 , P3 , P3 ' and P4 ■ discarded 

<220> 

<221> MOD_RES 
<222> (2) . . (3) 

<223> Xaa at positions 2 and 3 represent Leu and Ala 
with the peptide bond substituted by a 
transition-state isostere hydroxy ethylene 
(-CH(OH) -CH-2-) group 

<400> 29 
Asn Xaa Xaa Ala 
1 



<210> 30 
<211> 10 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : synthetic 
peptide NCH-gamma 

<400> 30 

Val Gly Ser Gly Val Leu Leu Ser Arg Lys 
15 10 



<210> 31 
<211> 326 
<212> PRT 

<213> Homo sapiens 
<220> 

<223> pepsin 
<220> 

<223> amino acids 2-5, 6-9, 13-20, 25-32, 65-67, 69-74, 
79-87, 89-91, 99-106, 119-122, 150-154, 164-167, 
180-183, 191-194, 196-199, 201-204, 210-214, 
221-223, 258-262, 265-269, and 275-278 are beta 
strands 

<220> 

<223> amino acids 281-284, 286-288, 298-301, 310-315, 
and 319-324 are beta strands 
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<220> 

<223> amino acids 48-51, 111-114, 136-142, 225-234, 
249-254, 271-274, and 303-306 are helices 

<220> 

<223> amino acids 12-13, 30, 32, 34-35, 73-77, 111, 117, 
120, 189, 213, 215, 217-220, 287, 289, 291, 298, 
and 300 are residues in contact with pepstatin. 



Asn Tyr Leu Asp Met Glu Tyr Phe Gly 
10 15 

Ala Gin Asp Phe Thr Val Val Phe Asp 
25 30 



<400> 31 

Val Asp Glu Gin Pro Leu Glu 
1 5 

Thr lie Gly lie Gly Thr Pro 
20 

Thr Gly Ser Ser Asn Leu Trp Val 
35 40 

Ala Cys Thr Asn His Asn Arg Phe 
50 55 



Pro Ser Val Tyr Cys Ser Ser Leu 
45 

Asn Pro Glu Asp Ser Ser Thr Tyr 
60 



Gin Ser Thr Ser Glu Thr Val Ser He Thr Tyr Gly Thr Gly Ser Met 
65 70 75 80 

Thr Gly He Leu Gly Tyr Asp Thr Val Gin Val Gly Gly He Ser Asp 
85 90 95 

Thr Asn Gin He Phe Gly Leu Ser Glu Thr Glu Pro Gly Ser Phe Leu 
100 105 110 

Tyr Tyr Ala Pro Phe Asp Gly He Leu Gly Leu Ala Tyr Pro Ser He 
115 120 125 

Ser Ser Ser Gly Ala Thr Pro Val Phe Asp Asn lie Trp Asn Gin Gly 
130 135 140 

Leu Val Ser Gin Asp Leu Phe Ser Val Tyr Leu Ser Ala Asp Asp Gin 
145 150 155 160 

Ser Gly Ser Val Val He Phe Gly Gly He Asp Ser Ser Tyr Tyr Thr 
165 170 175 

Gly Ser Leu Asn Trp Val Pro Val Thr Val Glu Gly Tyr Trp Gin He 
180 185 190 

Thr Val Asp Ser He Thr Met Asn Gly Glu Ala He Ala Cys Ala Glu 
195 200 205 

Gly Cys Gin Ala He Val Asp Thr Gly Thr Ser Leu Leu Thr Gly Pro 
210 215 220 

Thr Ser Pro He Ala Asn He Gin Ser Asp He Gly Ala Ser Glu Asn 
225 230 235 240 

Ser Asp Gly Asp Met Val Val Ser Cys Ser Ala He Ser Ser Leu Pro 
245 250 255 

Asp He Val Phe Thr He Asn Gly Val Gin Tyr Pro Val Pro Pro Ser 
260 265 270 
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Ala Tyr lie Leu Gin 
275 

Met Asn Leu Pro Thr 
290 

Phe lie Arg Gin Tyr 
305 

Gly Leu Ala Pro Val 
325 



Ser Glu Gly Ser Cys lie 
280 

Glu Ser Gly Glu Leu Trp 
295 

Phe Thr Val Phe Asp Arg 
310 315 

Ala 



Ser Gly Phe Gin Gly 
285 

lie Leu Gly Asp Val 
300 

Ala Asn Asn Gin Val 
320 



<210> 32 
<211> 19 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : beginning of 
promemapsin 2 Tl and T2 construct 

<400> 32 

Met Ala Ser Met Thr Gly Gly Gin Gin Met Gly Arg Gly Ser Met Ala 
15 10 15 

Gly Val Leu 



<210> 33 
<211> 6 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : ending of 
promemapsin 2 Tl construct 

<400> 33 

Gin Thr Asp Glu Ser Thr 
1 5 



<210> 34 
<211> 6 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : ending of 
promemapsin 2 T2 construct 

<400> 34 

Gly Phe Ala Val Ser Ala 
1 5 



<210> 35 

<211> 8 

<212> PRT 

<213> Artificial Sequence 



15 



<220> 

<223> Description of Artificial Sequence : synthetic 

memapsin 2 inhibitor substrate analogue OM99-2 

<220> 

<221> MOD_RES 
<222> (4) . . (5) 

<223> Xaa at positions 4 and 5 represent Leu and Ala 
with the peptide bond substituted by a 
transition-state isostere hydroxy ethylene 
(-CH(OH) -CH-2-) group 

<400> 35 

Glu Val Asn Xaa Xaa Ala Glu Phe 
1 5 



<210> 36 
<211> 8 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : pro-memapsin 2 
hydrolytic site #1 (aa 12-18 of SEQ ID NO: 3) 

<400> 36 

Arg Gly Ser Met Ala Gly Val Leu 
1 5 



<210> 37 
<211> 8 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : pro-memapsin 2 
hydrolytic site #2 (aa 23-30 of SEQ ID NO: 3) 

<400> 37 

Gly Thr Gin His Gly lie Arg Leu 
1 5 



<210> 38 

<211> 8 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : pro-memapsin 2 
hydrolytic site #3 (aa 98-105 of SEQ ID NO: 3) 

<400> 38 

Ser Ser Asn Phe Ala Val Gly Ala 
1 5 
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<210> 39 
<211> 8 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : pro-memapsin 2 
hydrolytic site #4 (aa 183-190 of SEQ ID NO: 3) 

<400> 39 

Gly Leu Ala Tyr Ala Glu lie Ala 
1 5 
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